Dynamic Characteristics of Multithreaded Execution in the EM - X Multiprocessor
نویسندگان
چکیده
Multithreading is known be e ective for tolerating communication latency in distributed-memory multiprocessors. Two types of support for multithreading have been used to date including software and hardware. This paper presents the impact of multithreading on performance through empirical studies. In particular, we explicate the performance di erence between software support and hardware support for the 80-processor EM-X distributed-memory multiprocessor which we have designed and implemented. The EMX provides three types of hardware supports for ne-grain multithreading including direct remote memory access, fast thread invocation, and dedicated instructions for generating xed-sized communication packets. To demonstrate the e ect of multithreading, we have performed various experiments using micro benchmark programs and MP3D, one of the SPLASH benchmarks. Three types of performance parameters have been measured including processor e ciency, remote memory latency, and network load. Experimental results indicate that the EM-X architecture is highly e ective for supporting the multithreading principles of execution through dedicated hardware and software. keywords Multithreading, latency hiding, ne grain communication, direct remote memory access, shared memory benchmark, synthetic workload.
منابع مشابه
Design and Evaluation of Dynamic Load Balancing Schemes under a Fine-grain Multithreaded Execution Model
The evolution of computer systems based on fine-grain multithreaded program execution models introduces both unique opportunities and tough challenges for the support of dynamic load balancing. Although load balancing is an active research topic in the distributed computing field, there is still a lack of a detailed study of the different dynamic load balancing strategies under a fine-grain mul...
متن کاملAnalysis of Performance Bottlenecks in Multithreaded Multiprocessor Systems
The performance of modern multiprocessor systems is often limited by the delays of interconnections or long latencies of memory subsystems. Instruction–level multithreading is a technique to tolerate such long latencies by switching from one instruction thread to another and continuing instruction execution concurrently with the long–latency operations. Using timed Petri net models, the paper a...
متن کاملAnalysis of Performance Limitations in Multithreaded Multiprocessor Architectures
The performance of modern multiprocessor systems is increasingly limited by interconnection delays or long latencies of memory subsystems. Instruction–level multithreading is a technique to tolerate such long latencies by switching from one instruction thread to another and continuing instruction execution concurrently with the long–latency operations. Using timed Petri net models, the paper an...
متن کاملFrom EARTH to HTMT: An Evolution of A Multithreaded Architecture Model
In this talk, we discuss the issues and challenges solving irregularly structured problems in parallel from the angle of system architectures and support To this end, multithreaded architecture models and systems provide an new opportunity for meeting such challenges. We begin by a brief review on the evolution of multithreaded models and architectures | in particular the EARTH (E cient Archite...
متن کاملPerformance Impact of Multithreaded Java Semantics on Multiprocessor Memory Consistency Models
The semantics of Java multithreading dictates all possible behaviors that a multithreaded Java program can exhibit on any platform. This is called the Java memory model and describes the allowed re-orderings among the operations in a thread. However, multiprocessor platforms traditionally have a memory consistency model of their own. Consequently memory barriers may have to be inserted to ensur...
متن کامل